Anecdotes???
Dublin Data Science
Insurely You’re Joking (Dublin|London)
Anyone who will have me
Talks
Workshops
Informal help
Reproducibility
Track changes
Collaboration
Issue tracking
Branch management
What is research?
Outcome unknown…
Try lots of stuff…
Record of work
Jupyter vs Zeppelin vs Rmarkdown
NOT for production
make and Makefiles
Directed Acyclic Graph (DAG)
sysadmin tasks
IMAGE_TAG=${PROJECT_USER}/${PROJECT_NAME}
CONTAINER_NAME=repro_research
render-html: ${PROJECT_NAME}.Rmd
Rscript -e 'rmarkdown::render("${PROJECT_NAME}.Rmd")'
docker-build-image: Dockerfile
docker build -t ${IMAGE_TAG} -f Dockerfile .
docker-run:
docker run --rm -d \
-p 8787:8787 \
-v "${PWD}":"/home/${DOCKER_USER}/${PROJECT_NAME}":rw \
-e USER=${DOCKER_USER} \
-e PASSWORD=quickpass \
--name ${CONTAINER_NAME} \
${IMAGE_TAG}
Quitting from lines 272-288 (10_carinspricing_exploration.Rmd)
Error in `[.tbl_df`(policyprop_dt, claim_count > 0) :
object 'claim_count' not found
Calls: <Anonymous> ... ggplot -> [ -> [.grouped_df -> NextMethod -> [.tbl_df
Execution halted
Lightweight containers
Library versioning
FROM rocker/verse:4.0.0
RUN apt-get update \
&& apt-get install -y --no-install-recommends \
byobu \
&& apt-get clean \
&& install2.r --error \
conflicted \
cowplot \
knitr \
revealjs \
snakecase
Issues
APIs (Yahoo! Finance)
\
Large datafiles
A Cautionary Tale…
TJLUYOM
https://web.archive.org/web/20180924182907/http://hginit.com/
https://www.dataquest.io/blog/jupyter-notebook-tutorial/
https://zeppelin.apache.org/docs/0.5.5-incubating/
http://matt.might.net/articles/intro-to-make/
https://edoras.sdsu.edu/doc/make.html
https://www.gnu.org/software/make/manual/html_node/index.html
http://ropenscilabs.github.io/r-docker-tutorial/
https://docker-curriculum.com/
https://gitlab.com/ecohealthalliance/drake-gitlab-docker-example
Email:
GitHub: